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We analyze the problem of quantum-limited estimation of a stochastically varying phase of a 
continuous beam (rather than a pulse) of the electromagnetic field. We consider both nonadaptive 
and adaptive measurements, and both dyne detection (using a local oscillator) and interferometric 
detection. We take the phase variation to be V5 = ^/K,£^{t), where ^(f) is (5-correlated Gaussian noise. 
For a beam of power P, the important dimensionless parameter is A'^ = P/hujK, the number of 
photons per coherence time. For the case of dyne detection, both continuous-wave (cw) coherent 
beams and cw (broadband) squeezed beams are considered. For a coherent beam a simple feedback 
scheme gives good results, with a phase variance ~ N~^^'^ /2. This is \/2 times smaller than that 
achievable by nonadaptive (heterodyne) detection. For a squeezed beam a more accurate feedback 
scheme gives a variance scaling as N~^^^ , compared to N~^^^ for heterodyne detection. For the case 
of interferometry only a coherent input into one port is considered. The locally optimal feedback 
scheme is identified, and it is shown to give a variance scaling as N~^^^. It offers a significant 
improvement over nonadaptive interferometry only for TV of order unity. 

PACS numbers: 42.50.Dv, 42.50.Lc, 03.67.Hk 



I. INTRODUCTION 

The phase of an electromagnetic field is not a quantity 
that can be directly measured. All phase-measurement 
schemes rely on measurement of some other quantity, 
which necessarily introduces an excess uncertainty in the 
phase estimate. The standard method of measuring the 
phase of a single mode is to combine it with a strong 
local-oscillator field, which is detuned from the signal 
(so the phase changes linearly with respect to the sig- 
nal phase). This is called the heterodyne scheme, and 
introduces an excess uncertainty scaling as 1 /n, where n 
is the mean photon number. If tlie signal phase is known 
approximately beforehand, the introduced phase uncer- 
tainty can be reduced greatly by using a local-oscillator 
phase that is 7r/2 out of phase with the signal (homodyne 
measurements ) . 

If there is no estimate for the phase available before- 
hand, it is still possible to reduce the excess phase uncer- 
tainty by adjusting the local-oscillator phase during the 
measurement so as to approximate a homodyne measure- 
ment 1^, H, ^ . The mark II dyne measurements consid- 
ered in Refs. Q and pf introduce an excess phase uncer- 
tainty scaling as n~^'^. It is even possible to attain the 
theoretical limit, scaling as Inn/n^, using a more sophis- 
ticated feedback scheme [Q . 

The case of interferometry is quite similar. In interfer- 
ometry we wish to measure the phase shift in one arm 
of an interferometer by counting photons in the output 
ports. If a phase shift varying linearly in time is intro- 
duced into the other arm (analogous to the heterodyne 
case), there is a large introduced phase variance scaling 
as n~^. On the other hand, if feedback is used to adjust 
the auxiliary phase shift adaptively, the introduced phase 
variance is greatly reduced |q, |1. 



These studies are all based on single-shot measure- 
ments, where the measurements are made on a single 
(one- or two-mode) pulse with finite duration and a sin- 
gle fixed phase. In practice, if we wish to transmit infor- 
mation via a beam, a time-varying phase would be more 
convenient. A time- varying phase may also arise through 
random fluctuations, and we may wish to keep track of 
the phase as well as possible. 

It is also possible to model a broadband signal that 
carries information by random fluctuations. We therefore 
consider the case of a phase subject to white noise in 
this paper. We consider cw measurements for both dyne 
measurements and interferometry. For the former, we 
consider both coherent beams and broadband squeezed 
beams. For interferometry it is not clear if there is a cw 
analog to the optimal two-mode states derived in Refs. 
1^, |j. Therefore, we consider only the case of a coherent 
input into one port. 

II. ADAPTIVE DYNE MEASUREMENTS ON A 
COHERENT BEAM 

First, we will consider the case of cw dyne measure- 
ments on a single beam with a varying phase. It is 
simplest to consider a coherent beam with amplitude 
a = \a\ exp[iip{t)] having a constant magnitude, but 
varying phase. The magnitude is scaled so that jap is 
the photon flux (P/huj). As explained above, the phase 
is assumed to diffuse in time, 

tpit + dt)=tp{t) + y^dW'{t). (2.1) 

Here dW' is a Wiener increment satisfying (dW')'^ ~ dt. 
The spectrum for the coherent beam is a Lorentzian of 
linewidth (full width at half maximum) k. 
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As in the single-shot quadrature of the field is 

measured by combining the mode to be measured with 
a large-amplitude local-oscillator field at a 50:50 beam 
splitter and measuring the outputs with photodetectors. 
The photocurrent is then defined by 



SN+ - SN- 
l[t) = hm hm — 

(5t^O/3^oo i36t 



(2.2) 



where SN-^- and 6N^ are the outputs from the photode- 
tectors and f] is the local-oscillator amplitude. For a 
continuous coherent beam this yields 



I{t)dt = 2Re(ae~'*(*))di + dW{t) 



(2.3) 



where $(t) is the phase of the local oscillator, and dW{t) 
is a Wiener increment independent of dW'(t). 

In making adaptive phase measurements the phase of 
the local oscillator is usually taken to be $(t) = (p{t) + 
7r/2, where (p{t) is some estimate of the system phase 
ip{t) 0. With this, the signal becomes 



I{t)dt = 2\a\ sin [ip{t) - ifi{t)] dt + dW{t). 



A. Linear Approximation 



(2.4) 



Provided that the estimated system phase is suffi- 
ciently close to the actual system phase, we can make 
the linear approximation 

I{t)dt = 2\a\[ip{t) - ip{t)]dt + dW{t). (2.5) 

Rearranging this equation, we see that 



e{t)^m+Iit)/2\a\ 



(2.6) 



is an unbiased estimator of (p(t) based on the data ob- 
tained in the infinitesimal time interval [t,t + dt). We 
will denote the best phase estimate based on all the data 
up to time t by 0(i). Note that this is the best phase 
estimate, in contrast to the phase estimate used in the 
feedback (p{t). The variance of each phase estimate 9{t) 
is given by 



[0{t) - ip{t)]' 



(dW{t)Y 
\2\a\dt J 



1 



Wdf 



(2.7) 



Here the simple definition of the variance has been used, 
rather than the Holevo phase variance H 



VH{e)^\{e 



ie\|-2 



- 1, 



(2.8) 



as in Refs. j|, ||, ^, ||, ^. This is because we are using 
the linear approximation. 

The noise in the estimate 9{t) is due entirely to the 
photocurrent noise, rather than the noise in the phase tp 
itself. Since dW{t) is independent of all previous noise, 
the updated best estimate Q{t + dt) will be a weighted 
average of the instantaneous phase estimate 6(t) and the 
estimate from all the previous data 0(i). 



The equilibrium value of the variance of Q(t), with all 
the individual phase estimates correctly weighted, will 
be denoted by AO'^. From Eq. (2.1), after a time dt the 
phase variance of &{t) with respect to the new system 
phase (p{t + dt) will be A8^ -I- ndt. The variance in the 
phase estimate from the latest time interval, 0(t), will be 
given by Eq. (2.7). 

If we take a weighted average of 0(i) a,nd9{t), then the 
contributions from each of the phase estimates from the 
individual time intervals should be correctly weighted, 
and the variance in the weighted average should be the 
equilibrium value, AQ^. This implies that 

Solving for A9^ gives A9^ = v^/2|a|. If we define 

iV=|a|VK, (2.10) 

the number of photons per coherence time (or photon 
flux divided by lincwidth), we have 

Ae^ 



l/2ViV. (2.11) 

This is the square root of the analogous result 1 /4n for 
a single-shot adaptive measurement on a coherent pulse 
of mean photon number n. 

Explicitly, the weighted average is 

{4\a\'^dt)e{t) + e(i)/(Ae2 + Kdt) 



^i^ + dt)^ 1/Ae^ 

Solving this as a differential equation gives 



(2.12) 



e{t)^2\a\V^ 6'(s)e2|"lv^(''-*)ds. (2.13) 



Therefore, this method corresponds to a simple negative 
exponential scaling of the weighting. 

We can also consider a more general negative exponen- 
tial scaling given by 

ft 



3)e'^("-*)ds. 



(2.14) 



Note that with this more general scaling, 8(<) is no longer 
necessarily the best phase estimate. For most of the re- 
mainder of this paper, &{t) will be used in this more 
general sense, rather than as specifically the best phase 
estimate. The best phase estimate will be found by find- 
ing the optimum value of %. Taking the derivative of this 
expression with respect to time gives 

e{t + dt) = xdte{t) + (1 - xdt)Q{t). (2.15) 

This means that this method is again a weighted average, 
except with a weighting that is not optimum. If we find 
the variance of both sides of this equation and solve for 
A6^ we obtain 

X 1^ 
8|^+2^- 

This equation has a minimmn of AQ^ — ^/K,l2\a\ for 
X = 2|q;|y^, reproducing the result foimd more directly 
above. 



Ae^ = 



(2.16) 
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B. Exact treatment 

The results of the pre vious section are all using the lin- 
ear approximation (2.5). Although this approximation is 
very useful for obtaining the asymptotic value of the vari- 
ance, it does not directly tell us what to do in the exact 
case. In the exact case for single-shot measurements 
rather than averaging phase estimates from each time in- 
terval, we determine Ay and By , defined (for scaled time 
V e [0, 1]) as 

e''^I{u)du, By^- e^'^du, (2.17) 
Jo Jq 



and obtain the phase estimate from 

e{v) ^ SiTg{vAy + ByAl) . 



(2.18) 



The intermediate phase estimate in the simplest (mark 
II) case [|j was 



ip{v) = argAi,. 



(2.19) 



We seek cw analogues of these formulas, that should 
reproduce the above linearized results in the appropriate 
(large TV) regime. Guided by Sec. II A, we replace the 
definitions of Ay and By by 



At = 



(2.20) 



it = / e-" 

^ — oo 

Bt^- f ex("-*)e2**du, (2.21) 

and continue to use arg A-t as the intermediate phase esti- 
mate (p{t). We will not consider any better intermediate 
phase estimates here, as these only give very small im- 
provements over the mark II case for coherent states. 

To find a formula for 0(i), we can use a similar ap- 
proach to that used in Ref . Q . Le t us ignore the vari- 
ation of the system phase in Eq. (2.2C). Since we ex- 
pect from Sec. II A that for large N the optimal x is 
0(|a|Y^) = 0{kaIn) 3> K, this is a reasonable approxi- 
mation. Then we find 



At = a/x - a*Bt + icrt, 



where 



gX(«-t)g»(*-'r/2) 



dW{u). 



(2.22) 



(2.23) 



Equation (2.22) is analogous to the corresponding result 
1^ for the case of single-shot measurements, except with 
V replaced with 1/x- Note that from this derivation it 
naturally emerges that we should use the same exponen- 
tial in the integrand for Bt as for Af. From Eq. ( ^.22| ) it 
can be shown that 

At + xBtAl = a(l/x - x\Bt?) + ^^t - ixBto^. (2.24) 



Taking the expectation value gives 

i,At^-xBtA\)^a{\lx-x\BtW 



(2.25) 



If the local oscillator phase is independent of the pho- 
tocurrent record, then this is exact. In the case of feed- 
back, Bt may be correlated with at , but this result should 
still be approximately true. Therefore, the phase esti- 
mate that will be used here is 



Q(i) = 'a.Yg{At+xBtAr, 



(2.26) 



Similarly to the single-shot case Q], we will define the 
variable Ct = At^ xBtA^, so Q{t) = argCj. The above 
derivation is not exact if the system phase is not constant; 
however, argC* should still be a good estimator for the 
phase in the semiclassical limit. 

A differential equation for the feedback phase can be 
determined in a similar way as in Ref. |]. Using Eq. 
( p. 20 ), we can determine the increment in At, 



dAt = e''^I{t)dt - xAtdt. 



(2.27) 



Taking the local oscillator phase to be ^{t) — arg At 
7r/2, we find that 



dAt = i^Iit)dt - xAtdt, 
so the magnitude of At varies as 



(2.28) 



d\At\^ = a; (dAt) + (dA*) At + (dA;) (dAt) 

^ {l-2x\At\^)dt. (2.29) 

Thus I At I increases up to an equilibrium value given by 



\At 



l/2x- 



Using this result, the increment in the feedback phase 
in the steady state is 



d$(t) = lm[dlnAt] 

= im — 

At 

_ I{t)dt 

~1M 



{dAtf 
2A? 



= ^2xl{t)dt. 



(2.30) 



Therefore, the feedback phase just changes linearly with 
the signal, with constant coefficient (rather than a time- 
dependent coefficient as in the pulsed case [^). 

Using this result gives the stochastic differential equa- 
tion for the phase estimate (p{t) as 



d^lt) = V2x{2|a| sm[Lp{t) - ip{t)]dt + dW{t)}. (2.31) 
Making a linear approximation gives 

difit) = V2x{2|a| [^{t) - ip{t)]dt + dW(t)}. (2.32) 
Rearranging and integrating then gives the solution as 

g2|a|V2^(«-t) [2\a\ip{u)du + dW{u)] . 

(2.33) 



m - v2x 
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If the phase is measured relative to the current system 
phase, then 



ip{u) 



K / dW'{s). 



(2.34) 



To determine an expression for the phase estimate 
note that it can be simphfied to 



e(t)-^(t)+arg(l + xe"'^^^*)i?t). 



(2.35) 



Using Eq. ( ^.21 ) and expanding the exponentials to first 
order gives 

e{t) « (p{t) + arg ^1 - iip(t) +ixj e'^("-*)^(M)du^ 
« X / <^(u)e'^("~*)dw. (2.36) 



This demonstrates that the mark II phase estimate is ap- 
proximately a weighted average of the intermediate phase 
estimates, just as in the pulsed case it is approximately 
an unweighted average |Q . Note also the similarity of this 
result to the result for the linear case (2.14). Unfortu- 
nately the simple technique used in the linear case cannot 
be applied here. However, using the standard techniques 
of stochastic calculus, the expectation value (0^(i)) can 
be determined from Eq. (2.36), in a lengthy but straight- 
forward calculation. The result is exactly the same as 



that obtained using the linear approximation (2.16) 



III. 



HETERODYNE MEASUREMENTS ON A 
COHERENT BEAM 



In order to determine how much of an improvement 
feedback gives for cw measurements, we will compare it 
with the case of cw heterodyne measurements. For het- 
erodyne measurements on a pulsed coherent state, the 
introduced phase variance is equal to the intrinsic phase 
variance. This indicates that the first term in Eq. ( 2.1t ) 
should be double for the heterodyne case, so the phase 
variance is 



X 



K 

2^' 



(3.1) 



We now show this more rigorously using a similar tech- 
nique to that used in Ref. Expanding At gives 

At = f ex("-*)e**(")[(ae~** + a*e**)dw + dW{u)]. 

(3.2) 

For the heterodyne case, the local oscillator phase ^{t) 
varies very rapidly, so the second term above will be neg- 
ligible. This means that At simplifies to 

At^\a\ f e^("-*)e''^(")dM + i(Tt. (3.3) 



Since Bt is also negligible, the phase estimate Q{t) sim- 
plifies to Q{t) = arg At. As above, the phase will be 
measured relative to the current system phase. In the 
limit N ^ 1, the system phase does not vary signifi- 
cantly during the time 1/X; so we can take the linear 
approximation, giving 

At « 1^ -|-z|a| /" eX("-*V(")rf" + «CTt. (3.4) 

X J —oo 

Using this, the phase estimate is 



e(t) « Im 



(3.5) 



Here the linear approximation has again been used. Fur- 
ther evaluating this gives 

Q{t) = -V^.X f due^^^-'^ f dW'{s) + ^AGt+<Tl). 

(3.6) 



The variance is, therefore, 



{e\t)) = KxU dui 



X / dW'{si) I dW'{s2) 



(3.7) 



The first term here can be evaluated to give k./2x- In 
addition, it is easy to show that (ctj) ~ and (|crtP) = 
1/2%. Using these results gives the variance as 



2x 4|a|2- 



(3.8) 



This shows that Eq. ( |3.1| ) is correct. Using this result, the 
minimum variance is ^JUj \p2\oc\ for x = '\/2K|a|. In terms 
of N ^ this is l/^/2N , which is \/2 times the minimum 
phase variance for the adaptive case. 



IV. RESULTS FOR DYNE MEASUREMENTS 
ON A COHERENT BEAM 

In order to verify the above analytical results, the equi- 
librium phase variance was determined numerically for a 
variety of parameters. Because we do not presuppose a 
value for x, there are two dimensionless parameters in 
our simulations, 



N 



K 



X 



X 

lap 



(4.1) 



From the above theory, the optimum value of X is 2/vjV 
for the adaptive case and \J2/N for the heterodyne case. 

The value of N was varied from 1 up to 2.5 x 10"^^. 
For each value of iV, X was varied from a quarter to four 
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FIG. 1: The phase variance for cw adaptive measurements for 
X — 2/y/N. The numerical results are shown as crosses and 
the theoretical values of l/y/2N are shown as the continuous 
line. The inset shows the ratio of the minimum phase variance 
for cw adaptive measurements to the minimum phase variance 
for cw heterodyne phase measurements. 




FIG. 2; The phase variance as a function of X for A*' — 10^. 
The numerical results for adaptive and heterodyne measure- 
ments are shown as the crosses and pluses, respectively, and 
the theoretical results for adaptive and heterodyne measure- 
ments are shown as the continuous line and dotted line, re- 
spectively. 



times 2/^/N. Measuring time in units of |q!|~^, the time 
steps used were At = 1/10'^X. For these calculations 
1024 simultaneous integrations were performed and the 
variance was sampled repeatedly. The integrations were 
taken up to time 10/X, in order for the variance to reach 
its equihbrium value, then the variance was sampled at 
time intervals of 1/X up until time 100/X. 

The results for X = 2/VN are plotted in Fig. |l|. The 
variances for iV = 1 to 4 x 10^^ are the Holevo variances, 
and for above 4 x 10^^ are the standard variances. As 
can be seen, the results are very close to the theoretical 
values. To show the improvement over heterodyne mea- 
surements, the ratio of the minimum phase variance for 
adaptive measurements to the minimum phase variance 
for heterodyne measurements (with X = y^2/N) is plot- 
ted in the inset of Fig. |l|. The ratio is close to 1 for small 
N, but for larger N the ratio gets closer and closer to 
1/V2. 

Alternatively we can plot the phase variance as a func- 
tion of X for fixed N. In Fig. ^ we have shown the phase 
variance as a function of X for N — 10^, for adaptive and 
heterodyne measurements. The numerical results agree 
reasonably closely with the theoretical values, although 
there is a noticeable difference for adaptive measurements 
for the larger values of X. Note that the minimum phase 
variance for adaptive measurements is at X = 
and the minimum phase variance for heterodyne mea- 
surements is larger and at a smaller value of X. When 
the value of N is increased further, the numerical results 
agree even more closely with the theoretical values. 



V. ADAPTIVE DYNE MEASUREMENTS ON A 
BROADBAND SQUEEZED BEAM 

The above results show that the improvement offered 
by adaptive measurements over nonadaptive (hetero- 
dyne) measurements in the case of a coherent beam is 
only a factor of l/\/2 reduction in the variance. This is 
similar to the single-shot case, where a 1/2 reduction was 
found for the coherent case. However, in the single-shot 
case a far more dramatic reduction is found for the case 
of a squeezed state. Motivated by this we now consider 
adaptive dyne measurements on a cw squeezed beam. 

It is simplest to consider broadband squeezing. Physi- 
cally, this could arise as the output of a driven parametric 
oscillator in the limit that the decay time of the cavity 
is much shorter than any other relevant timcscales f^. 
This resu lts in the modification of the photocurrcnt from 
Eq. (O) to 



I{t)dt = 2Re(ae"**(*))dt + dW{t) 

X cos2($ - (j)j2) + e2'^ sin2($ - (j,j2), 

(5.1) 

where a is the amplitude of the squeezed beam, and r 
and 0^ are the magnitude and phase of the squeezing, 
respectively. In this idealized limit the noise reduction 
via squeezing occurs by a reduction in the shot noise level, 
rather than an anticorrelation between the shot noise and 
the later coherent amplitude (as in the single-shot case). 

For reduced phase uncertainty, the phase of the squeez- 
ing should be (f>i^ = 2ip + tt, where ip is the system phase. 
If we are using feedback given by $ = if + tt/2, where ip 
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is an estimate of the phase, then the photocurrent can 
be expressed as 

I{t)dt = 2|a|sin(i^- ip)dt 



+ dW{t)\J e-"^'' cos^ {if -</') + ' 



-ip). 
(5.2) 



It is clear that if the intermediate phase estimate used 
is very close to the system phase, then the factor mul- 
tiplying dW will be close to and will be at a min- 
imum. The better the intermediate phase estimate is, 
the smaller this multiplying factor will be. If the inter- 
mediate phase estimate is not perfect, it is clear that 
increasing the squeezing past a certain level will not re- 
duce the multiplying factor. This is because the e^^ term 
will start to dominate. 

It is possible to estimate the optimum squeezing and 
the minimum phase variance using the linear approxima- 
tion. In this approximation, the variance in the individ- 
ual phase estimates 9{t) is 

[e-^'' cos^ifi -ip) + e^'' sin^iifi - ip)]/4\afdt. (5.3) 

It is clear that the minimum phase variance (in this ap- 
proximation) will be obtained when the best phase esti- 
mates are used for ip. It is therefore reasonable to use the 
phase estimates Q{t) for ip, rather than arg At as in the 
coherent case. The values of 0(i) will be the best phase 
estimates when the correct x is used. As the variance of 
these estimates is AQ^, we obtain 



2r ■ 2 

e sm 



(<^ -</?)) we-^^ + e'^Ae^. 



(e cos^((^ — If) + 

(5.4) 

This approximation will be true for small phase variances 
and large squeezing. Following the same derivation as for 
the coherent case, the only difference is the multiplying 
factor, so we obtain 



X 



-2r 



„2r 



Ae^ 



(5.5) 



This expression has two independent variables, x ^-nd 
r, that can be varied in order to find th e mi nimum phase 
variance. Taking the derivative of Eq. ( ^.5|) with respect 
to X &iid setting the result to zero gives 



X 



Ae2' 



Substituting this into Eq. (5.5) gives 



AG^ 



4|a|' 



2r 



Ae2 



(5.6) 



(5.7) 



Taking the derivative of this with respect to r and again 
setting the result equal to zero gives 

e-^'- = Ae^. (5.8) 

Substituting this back into Eq. ( ^.7D gives the phase vari- 
ance as 

2/3 / 1 \ 2/3 

Ae^ = ■ ■ " 



2|aP 



1 

27V 



(5.9) 



Thus we see that even for an arbitrarily squeezed beam, 
the best scaling we can obtain for the phase variance is 
N~'^/^ , as compared to N^^/"^ for a coherent beam. This 
difference is less than for pulsed measurements, where 
the phase variance for the optimum squeezed states scales 
almost as n~^, as compared to for coherent states. 



VI. HETERODYNE MEASUREMENTS ON A 
BROADBAND SQUEEZED BEAM 

In order to determine the phase variance for hetero- 
dyne measurements on a squee zed beam, we can simply 
perform the derivation of Sec. Ill, except with the fac- 
tor multiplying dW from Eq. (5.1) included. This means 
that the variance will be 



2x 4|a|2 
except with at modified to 



(6.1) 



X y^e-^'- sin^($ - Lp) + e^'' cos^{<^ - if>)dW{u). 

(6.2) 

Here we have used the assumption that the phase of the 
squeezing is 2tp + tt. Note that the derivation of Sec. 
Ill takes the phase relative to the current system phase. 
This means that to a first approximation we may take 
ip{u) = 0. 

In order to determine the phase variance, we must de- 
termine the expectation values (IcrtP) and (cr|). We find 



(|atp) = r e2x(«-t) (e-2'-sin2$ + e2-cos2$)du. 

J — OO 

(6.3) 

As the local oscillator phase $ is varying rapidly in the 
heterodyne case, we may take the average values of sin^ 
and cos^, giving 



cosh(2r) 



2X 



(6.4) 



Similarly, evaluating (cr^) gives 



(a?) = - r e^^("-*)e2'* {e~^^ sin^ $ + e^^ cos^ $) du. 

J — OO 

(6.5) 

Taking trigonometric averages as above gives 
2\ sinh(2r) 



4X 



(6.6) 



Using these results we obtain the phase variance as 

(6.7) 



(92 (i)) ^ — + cosh(2r) - i sinh(2r) 



2X 



4|a|Vx 
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This differs from the result for the coherent case by 
the multiplying term cosh(2r) — isinh(2r). This has a 

minimum of ■\/3/2 for r = ln(3)/4. Using this value, 
we obtain the minimum variance as i^/'^y/n/ {2a) for 
X = 2-y/K|Q;|/3^/^. Thus we find that the scahng is the 
same as for a coherent beam, and the multiplying factor 
is only about 7% smaller. In contrast there is a factor of 
two difference in the single-shot case. 



VII. RESULTS FOR DYNE MEASUREMENTS 
ON A BROADBAND SQUEEZED BEAM 

The results for the cw squeezed beam were obtained by 
a similar method as for the cohe rent case. Only variation 
in the variables N and X of Eq. (4.1) was considered, and 
time was measured in units of |a|~^. The step sizes used 
were Ai = 1/lO^X. The integrations were taken up to 
time 30/X, then the variance was sampled every time 
step until time 130/X. The integration was performed 
using the photocurrent given in Eq. ( 5T ) with 4>q = 2lp + 

IT. 

It was found that when ip{t) — arg Ct was used in the 
feedback, very poor results were obtained. This is a simi- 
lar result to the case for single-shot measurements, where 
using arg Cy feedback results in large phase variances Q . 
This is because, when the intermediate phase estimates 
are extremely good, the results do not distinguish eas- 
ily between the real system phase and the system phase 
plus TT. This means that many of the results are out by 
TT, resulting in a large overall phase variance. 

In order to avoid this problem, rather than using arg Ct 
in the feedback, an intermediate phase estimate given by 



(7.1) 



was used, with e constant. Note that this is the same as 
used to obtain phase measurements close to optimum in 
the single-shot case, except that there a time-varying e 
was used. 

For each value of N there are three variables that can 
be altered to minimize the phase variance: X, r, and s. It 
is not calculationally feasible to consider a range of values 
for all three variables. Instead, different values were tried 
systematically to find the minimum phase variance. 

The minimum phase variances obtained by this 
method are plotted as a functio n of N in Fig. ^j. The 
theoretical values given by Eq. (5.9) are also shown in 
this figure. The numerical results are higher than the 
theoretical values, but they have the same scaling with 
N, namely, N^^^^. If we plot the ratio of the numerical 
results to the theoretical values as in the inset of Fig. ||, 
we find that for the largest values of N the ratio levels 
ofi^ at about 2.6. 

Now note that, from Eqs. (pT§) and (5^), the optimum 
valu e of e" ^*" s hould be {2N)^^^/^. Similarly, from Eqs. 
( ^ and (U), the optimum value of X is 
The numerically obtained optimum values of e"^"" and 
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FIG. 3; The phase variance as a function of A'' for a cw 
squeezed beam. The theoretical relations for adaptive and 
heterodyne measurements are shown as the continuous line 
and dashed line respectively, and the numerical results for 
adaptive and heterodyne measurements are shown as the 
crosses and pluses respectively. The inset shows the ratio 
of the numerically obtained phase variance to the theoretical 
value as a function of A'' for adaptive measurements. 




FIG. 4: The optimum values of e~ ^ , X, and e for measure- 
ments on a cw squeezed beam. The numerically found values 
of e"'^'' are plotted as crosses, and the theoretical expression 
as a continuous line. The numerically found values of X are 
plotted as pluses, and the theoretical expression as a dashed 
line. The numerically found values of e are plotted as circles, 
and the dotted line is the expression fitted to the data. 



X, as well as these theoretical expressions, are plotted 
in Fig. H Similarly to the case for the phase variance, 
the scaling is the same as theoretically predicted, but the 
scaling constants are different. For the case of e"^'', the 
optimum values are about eight times those theoretically 
predicted, whereas the values of X are around a third of 
those theoretically predicted. 

For the case of e there is no theoretical prediction for 
the optimum value. The numerically obtained values are 



8 



la>_^ 




FIG. 5: The Mach-Zehnder interferometer, with the addition 
of a controllable phase $ in one arm. The unknown phase to 
be estimated is tp. Both beam splitters are 50:50. 



shown in Fig. ^, and as can be seen e decreases in a 
regular way with increasing TV. A power law was fitted 
to these values (for > 1), and the power found was 
—0.35 ± 0.01. This is very similar to the N~^/'^ scaling 
found for e"^'' and X. 

The results for heterodyne measurements are also 
shown in Fig. ^. The results in this case agree very accu- 
rately with the theoretical prediction, within about 0.5% 
for the larger values of N . Similarly the optimum values 
of r and x agree very accurately with those predicted 
above. The variance scales as iV~^/^, in contrast to the 
variance for adaptive measurements that scales as N~^/^. 
This means that the improvement in using adaptive mea- 
surements scales as A^~^/^, which can be very large for 
large N. 



VIII. CW INTERFEROMETRY 

Now we will turn from dyne measurement on a single 
beam to cw interferometric measurements. In this case 
we have a Mach-Zehnder interferometer (MZI), and are 
attempting to continuously track a stochastically varying 
phase in one arm, by controlling the phase in the other 
arm and detecting photons in the two output beams. 
This is shown in Fig. ||. In this context it is not possible 
to consider nonclassical states of the type considered for 
the single-shot case Instead, for simplicity, we will re- 
strict our consideration to the case where all photons en- 
ter through one port. This can be realized using coherent 
light, with |ap photons per unit time. Note that because 
this is an interferometric measurement rather than one 
using a local oscillator as a phase reference, the phase of 
a is irrelevant. 

This case is essentially semiclassical, and the detections 
can be considered independently. Therefore, consider the 
state with a single photon incident on port a. The an- 



nihilation operators for the output modes of the MZI, cq 
and ci, are related to the annihilation operators for the 
input modes, d and 6, by |^ 

Cu =asin[((^-$-)-u7r)/2]+Scos[(<^-<I>-fu7r)/2], (8.1) 

for u E {0, 1}. Hence the probability for detecting the 
photon in detector u is given by 



sin^ [{ip - $ + u7r)/2]. 



(8.2) 



Using Bayes' theorem, the probability distribution for the 
system phase after the detection is proportional to this 
probability times the initial probability distribution. 

Denote the results for m such detections by the string 
TT-rn = UmUm-i ■ ■ ■ Ui- The probability distribution for 
the phase given rim, P{'^\nm), can be expressed as 



P((p|n„ 



.3) 



In the absence of any phase variation, it can be 
shown from Eq. ( |8.2| ) that the unnormalizcd coefficients 
Pm-k^^rn) Can be determined by 

^'m-fc("™)=/'m-i;fc(n™_i)-ie-'(*™-"'"")P™_i;fc_i(n™_i) 



2^ 



m--l;k+l 



{nm-l)- 



(8.4) 



The normalization condition on the probability distribu- 
tion becomes Pm-fiinm) = 1- The normalized probability 
distribution can be obtained by simply dividing the co- 



efficients obtained from Eq. (8^) by Pm-oi'^^m) 



Similarly to the case of dyne measurements, we will 
assu me that the system phase diffuses with time as in Eq. 
( ^■l[ ). When the phase varies in time, the time between 
detections is important. For a photon flux of the 
probability of a photodetection in time dt is japdt. The 
probability distribution for the time between detections 
is given by the exponential distribution 



PE{t)dt 



\a\'e- 



dt. 



(8.5) 



In the results that will be presented here, the time be- 
tween detections. At, was determined according to this 
probability distribution. 

Now in order to determine the effect of this phase dif- 
fusion on the probability distribution between detections, 
we must first consider the effect over some very small time 
interval St. This is necessary because the probability dis- 
tribution for the change in the system phase over time 
At does not go to zero for Aip = zLtt. This means that 
the probability distribution will not be exactly Gaussian, 
due to the overlap. In contrast, if we look at a very small 
time interval St, the change in the phase will have a nor- 
mal distribution with a variance of uSt. Explicitly the 
probability distribution is 



PGiAip)diAip) 



1 



y2TTK5t 



-A<p=^/(2K5t)rf(A<^). (8.6) 
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The probability distribution for the phase after time 
St will be the convolution of the initial probability distri- 
bution with the Gaussian described by Eq. (|8.6|). Evalu- 
ating this convolution gives 



Pi^ - e\n^)PGi0)d0 



m/2 



k=-m/2 



-ike 



(8.7) 

As 5t is assumed to be small, nSt ^ 1, and the integral 
in Eq. (|^) evaluates to e"*^ «;5t/2_ ^he effect of the vari- 
ation of the system phase on the probability distribution 
is, therefore, 

Pt,kM = Pm:kMe~'''''^''^. (8.8) 

In order to take account of the effect of the phase dif- 
fusion on the probability distribution over some signifi- 
cant time interval At, this time interval can be thought 
of as comprising M small time intervals 5t. Then we 
find that the coefficients are just multiplied by M terms 
of e^^ This is equivalent to a single term of 

g-fc which is very easy to i mple ment. 

As time passes the effect of Eq. 



4 ) is to broaden the 



distribution of probability coefficients in fc, correspond- 
ing to a smaller variance in the phase distribution. In 
contrast, the Gaussian term in Eq. (B.8|) tends to narrow 



the distribution of probability coefficients, corresponding 
to a greater phase variance. The initially broad phase 
distribution narrows until an approximate equilibrium is 
reached, where the two effects cancel each other out. 

In Ref. 1^ it was shown that the optimum phase esti- 
mate for the single-shot case is 

e = arg(e''^) = argF„,._i(n„). (8.9) 

It is easy to see that this phase estimate is optimal in 
the cw case also. In addition we consider feedback that 
is equivalent to that considered in the single-shot case 
in Ref. [^. Rather than using an intermediate phase 
estimate as in the dyne case, we use the full power of 
Bayesian statistics to choose the feedback phase $ so 
as to minimize the expected Holevo phase variance after 
the next detection. This is achieved by choosing <I>„i to 
minimize the value of [|, 



M($, 



E 



PM^)e'^d^ 



(8.10) 



The values of P{nra\^p) can be obtained, except for a 
normalizing cons tant that is common to u„i = and 
1, by using Eq. (B.4). This means that we can express 
M(<I>m) as in Ref. |3| with the parameters a, 6, and c 
given by 



a = ^?ri-l;-l("-m-l), 
^ — 5-Pm-l:-2('T-m-l)i 
C = \Pm-l-fi{nm-l) ■ 



(8.11) 



These values of a, 6, and c can be used to determine the 
feedback phase as in Ref. |^ . 

The phase uncertainty at equilibrium can be estimated 
using a similar approach as was used for the single-mode 
case. Let us assume that the equilibrium variance in the 
best estimate for the system phase is A9^. After time 
At, the variance in this phase estimate with respect to 
the new system phase, Lp{t -f At), will be A8^ -f AtAt. In 
the equilibrium case this increase in the variance should, 
on average, be balanced by the decrease due to the next 
detection. 

We now wish to estimate the equilibrium variance 
based on a weighted average with the previous best phase 
estimate, and a phase estimate from the new detection. 
If we use the actual variance for a phase estimate based 
on a single detection, then we do not get accurate results. 
This is because the variance for a single detection is large, 
so the weighted average does not accurately correspond 
to the exact theory. In order to make the theory based 
on weighted averages accurate, we need to assume an ef- 
fective variance for the single detection, that is different 
from the actual variance. 

In the case where there is no variation in the system 
phase, the phase variance after n detections is approxi- 
mately 1/n Q. It is clear that, if we assume that each 
detection has an effective variance of 1, then we will ob- 
tain the correct result. This is, in fact, equal to the 
variance as estimated using (2(1 — cosip)) (this measure 
is used, for example, in Ref. |l^). Applying this to the 
case with a varying system phase gives 



Ae2 + kM 



1 



1 



Ae2' 



.12) 



Simplifying this to solve for AO^ , we find AG'^ « V wAt. 
On average, the time between detections is l/jap, so the 
approximate value of the variance should be 



Ae^ « i/^V|aF = 



.13) 



IX. RESULTS FOR CW INTERFEROMETRY 

In order to verify this theoretical result, the equilib- 
rium phase variance was determined numerically for a 
variety of parameters. In this case there is only one di- 
mensionless parameter, N . In the case of dyne measure- 
ments there was the additional parameter X describing 
how the latest results were weighted as compared to the 
previous results. In this case we do not have this param- 
eter, as the phase estimates are not determined in that 
way. 

The calculations were run for 10^ detections (or 2 x 10^ 
for the maximum value of N)^ and the phase error was 
sampled every detection after lOVlV detections. This 
was done 100 times for each value of N . The equilibrium 
phase variance was determined in this way for the nearly 
optimum feedback scheme described above. In addition 
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we tested a nonadaptive measurement scheme with 

= $0 + m7r/\/7V, (9.1) 

where $o is a random initial phase. When the value of 
N was 1 or less this was modified to 

$„ = $o + m7r/2, (9.2) 

to prevent $m being constant (modulo tt) . This is equiv- 
alent to the nonadaptive scheme in the single-shot case 
used in Ref. ||] , and is analogous to heterodyne measure- 
ment. The reason for the factor of 1/y/N is that the 
effective number of detections used for the phase esti- 
mate is ViV. This follows from the fact that the phase 
variance is approximately 

A minor problem with cw adaptive measurements is 
that the number of probability coefhcients Pm-k{nm) 
needed to determine the probability distribution for the 
phase rises indefinitely with the number of detections. 
The narrowing effect of the varying system phase, how- 
ever, means that the probability coefficients fall approx- 
imately exponentially with k. The probability distribu- 
tion can, therefore, be approximated very accurately by 
keeping only a certain number of coefficients. For the 
results presented here all probability coefficients with a 
magnitude above about 10~^° were used. 

The Holevo phase variances for the two measurement 
schemes are plotted in Fig. ^ As can be seen, the results 
for both cases are very close to the theoretical result of 
1/Vn for the larger values of N. For values of N closer 
to 1 the results for the nonadaptive scheme are noticeably 
above the theoretical values. For small values of N (less 
than 1), the variance converges to 3 for both the feed- 
back schemes. This is what can be expected, as the sys- 
tem phase is randomized between detections. This means 
that the measurements are equivalent to phase measure- 
ments with a single photon, for which the Holevo phase 
variance is 3. The feedback has no effect, as there is no 
information on which to base it. 

To see the differences more clearly, the phase variances 
are plotted as ratios to the theoretical values in the in- 
set of Fig. ^. The adaptive scheme gives phase variances 
that are very close to, and slightly below, the theoret- 
ical values for moderate values of N. In contrast, the 
results for nonadaptive measurements are all above the 
theoretical values (for iV > 1). For small values of N the 
variance for both schemes is below 1 /VlV, as the variance 
is converging to 3. 

These results show that there will be a significant im- 
provement in using an adaptive scheme over a nonadap- 
tive scheme only if the time scale for the system phase 
variation is comparable to the time between detections. 
This can be expected from the results for the single-shot 
case with all photons in one port, where there was a sig- 
nificant improvement in using an adaptive scheme only if 
the photon number was small. The maximum improve- 
ment here is about 24% for iV « 4. 
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FIG. 6: The phase variance as a function of A'^. The numerical 
results for adaptive and nonadaptive measurements are shown 
as the crosses and pluses, respectively, and the theoretical 
values are shown as the continuous line. The inset shows the 
phase variance as a ratio to the theoretical value of 1/y/N. 
The results for adaptive and nonadaptive measurements are 
shown as the continuous line and dotted line, respectively. 



X. CONCLUSIONS 

This study considered the problem of cw phase mea- 
surements, where the phase is being varied randomly in 
time and the aim is to follow this variation with the min- 
imum possible excess uncertainty. We considered three 
different situations: dyne measurements on a coherent 
beam, dyne measurements on a (broadband) squeezed 
beam, and interferometric measurements using a coher- 
ent beam input. The relevant dimensionless parameter is 
the number of photons per coherence time (the char- 
acteristic time for the phase diffusion). Under optimum 
conditions, we found the analytical results, confirmed nu- 
merically, shown in Table ^. Previous results obtained for 
single-shot measurements on a pulse containing n, or n 
on average, photons are also shown for comparison. 

A number of regularities are evident from this table. 
With coherent light, the variance reduction offered by 
adaptive measurements is at most a multiplying factor. 
With nonclassical light, nonadaptive measurements scale 
in the same way as for coherent light, but adaptive mea- 
surements offer an improvement in the scaling. In all 
cases, the variance reduction (by a change in the prefac- 
tor or the scaling) is less in the cw case than in the pulsed 
case. This is because in order to obtain the best phase 
estimate, as N increases, the memory time for the esti- 
mate is reduced. This is needed to keep the contribution 
to the variance from the varying system phase (which in- 
creases with memory time) comparable with that from 
the quantum uncertainty (which decreases with memory 
time) . This means that the effective number of photons 
used for the estimate is the number per memory time, 
rather than the number per coherence time, N. 
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TABLE L Scaling of phase variances for large photon numbers A*' under various measurement conditions. For the cw (continuous- 
wave) cases, A'' is the number of photons per coherence time. In the pulsed cases, n (n) is the (mean) photon number per pulse. 
Dyne measurements are those performed on a phase-shifted beam or pulse using a local oscillator, while MZI measurements 
are of a phase shift in one arm of a Mach-Zehnder interferometer. The two empty cells are those not treated in this study, and 
the question mark denotes a conjectured scaling. 

Coherent, dyne Squeezed, dyne Coherent, MZI Optimal, MZI 
cw, adaptive A-^/^/S 0(Ar-2/3) jy-i/a 
cw, nonadaptive N-^''' N'^''^ x i'''^ /2 N'^''^ 
Pulsed, adaptive n~^/4, 0(lnn/n^) n"^ 0(lnn/n^)? 

Pulsed, nonadaptive n~^/2 n~^/4 0{n~^) 



In the case of dyne measurements on a coherent beam, 
it was found that good results were obtained using a sim- 
ple feedback phase (arg At), similarly to mark II single- 
shot measurements B . In the cw case, the feedback sim- 
plifies to a form even simpler than for the single-shot 
case. Specifically, the feedback phase is simply adjusted 
proportional to the photocurrent. When the correct pro- 
portionality constant is selected, a minimum equilibrium 
phase variance is found that scales as N~^/^ /2. This is 
only times smaller than the phase variance for het- 
erodyne measurements. 

For the case of dyne measurements on broadband 
squeezed states, the situation is considerably more com- 
plicated. The change in the phase cannot be taken to 
be proportional to the current, but rather is a functional 
with two parameters. With the degree of squeezing to be 
optimized as well, there are three parameters that must 
be varied to find the minimum phase variance. Neverthe- 
less, it is still possible to obtain an analytic result that 
agrees with the numerical results in its scaling (although 
predicts the wrong multiplying factor). Specifically, it 
was found that the minimum phase variance varies as 
N~^^^, compared to iV"^/^ for a coherent beam. This 
contrasts with heterodyne measurements on broadband 



squeezed states, for which the minimum variance is only 
about 7% below the corresponding result for a coherent 
beam. 

The case for interferometry is more difficult to treat, 
as it does not work with any simple feedback scheme. 
The feedback used was based on minimizing the expected 
variance after the next detection, similarly to the single- 
shot case. Despite this, it was found that it is possible to 
determine an approximate theory that agrees reasonably 
well with the numerical results for the case where a coher- 
ent beam enters one port of the interferometer. Similarly 
to the dyne case with a coherent state, the phase vari- 
ance is proportional to N^^^^. When a linearly changing 
feedback phase was used (analogous to the heterodyne 
scheme), it was found that the phase variance is above 
that for the adaptive feedback, but the difference is very 
small except for N of order unity. This is as can be 
expected, as the difference is also very small for large 
photon numbers in the single-shot case. 

In comparison with our previous pulsed results, the 
cw results obtained in this paper are probably more rel- 
evant to, and in some cases easier to implement in, a 
quantum-optics laboratory. This augurs well for future 
experimental verification. 
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